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■ Abstract 

^ I In the classical occupancy scheme with infinitely many boxes, n balls are thrown 

' independently into boxes 1,2,.. ., with probabilities pj, j > 1. We establish approx- 

imations to the distributions of the summary statistics Kn, the number of occupied 
! boxes, and Kn,r, the number of boxes containing exactly r balls, within the fam- 

Ph ' ily of translated Poisson distributions. These are shown to be of ideal order as 

• n — > oo, with respect both to total variation distance and to the approximation of 

, point probabilities. The proof is probabilistic, making use of a translated Poisson 

approximation theorem of Rollin (2005). 
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1 Introduction 



In the classical occupancy scheme with infinitely many boxes, n balls are thrown in- 
5^ \ dependently into boxes 1,2,..., with probability pj of hitting box j, j > 1, where 
Pi > P2 > • • • > and "^JLiPj = 1- The summary statistics Kn, the number of oc- 
cupied boxes, and Kn,r, the number of boxes containing exactly r balls, have been widely 
studied. Central limit theorems were established by Karlin (1967), under a regular vari- 
ation condition, and Dutko (1989) showed that Kn is asymptotically normal, assuming 
only the necessary condition that its variance tends to infinity with n. A full discussion of 
this and many more aspects of the problem can be found in Gnedin et al. (2007); see also 
Barbour & Gnedin (2009), in which multivariate approximation of the Kn^r is treated. 
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As regards the accTiracy of the central hmit approximation, Hwang & Janson (2008) 
show that the point probabihties P[-ft'„ = t] are uniformly approximated by the point 
probabilities of the integer discretization of the normal distribution J\f{ij,n,a^), where 
Hn := E,Kn and cr^ VarX^. The accuracy of their approximation is of order 0(1/(T^), 
provided only that cr^ — > oo as n — > oo. This is the same accuracy as would be expected for 
sums of independent indicator random variables, and is thus a remarkably precise result. 
However, their proof requires long and delicate analysis of the corresponding generating 
functions. The purpose of this paper is to derive their result by purely probabilistic argu- 
ments, to complement their result with a distributional approximation in total variation, 
and to investigate the quantities Kn^r as well. 

The approach that we take begins with the well-known observation that, if the fixed 
value n were replaced by a Poisson distributed random number with mean n, then the 
numbers of balls in the boxes would be independent Poisson random variables. Approx- 
imations of the kind to be discussed would then be immediate, from the theory of sums 
of independent Bernoulli random variables. The essence of the problem lies in the depen- 
dence introduced by fixing n. One way of relaxing this dependence is to disregard the first 
few boxes, for which the result is essentially known, and to use the fact that the number 
of balls falling in the remaining boxes is now random. Indeed, defining jn > 1 in such a 
way that 

> 4n-Mogn > (1.1) 

it is immediate that 

P[Ar,>lforallj<i„-l] > 1-^(1-11^)" > l-r^-^ 

4 log n V n / 

so that, except on a set of probability at most n^^, we have 

in-l 

E = ^" - 1' (1-2) 

i=i 

where Ij := I[Nj > 1]. Furthermore, a simple Poisson approximation argument, due to 
Le Cam (1960) and Michel (1988), can now be used to get a sharp description of the 
distribution of the remaining elements in the sum := X^j>i Ij, since 

dTY{C{Nj, J >Jn),C{Lj, J >Jn)) < Pn ■= ^Pj, 

j>jn 

where {Lj, j > jn) are independent Poisson random variables with means ELj = npf sec 
Barbour & Gnedin (2009, Section 2). This means that the random sequences (Jj, j > jn) 
and {I[Lj > 1], j > jn) can be constructed to be identical, except on a set of probability 
at most Pn, so that, except on a set of probability at most n^"^ + P„, the distribution 
of Kn agrees with that of a sum of independent indicators, the first j„ — 1 of which are 
equal to 1. Hence a discretized central limit theorem and uniform approximation of point 
probabilities follow, using A/'(/i.n, c^) as basis, with accuracies 0((T~^ -|- + Pn) and 
0{an^ + n~^ + Pn) respectively, and analogous results are also true for the statistics Kn,r- 
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The drawback to this very simple approach is that it need not be the case that, for 
instance, Pn = 0{a^^). For example, Karlin's case of regular variation allows the possi- 
bility of having x n^, for any given (3, < P < 1. In such cases, P„ x (n~^ logn)^"'^, 
so that Pn = 0(c7„^) is not true ii /3 > 1/2, and P„ = 0(cr„^) is not true ii /3 > 2/3. To 
get the result of Hwang & Janson (2008), we in general need something sharper. 

Our approach involves a technique analogous to that above, discarding a set of indices 
for which the outcome is essentially known, and using the randomness in the remainder. 
Foregoing the total independence of the above scheme, which costs too much to achieve, 
we instead construct a conditionally independent sequence of Binomial random variables 
within the problem, and use these to provide the necessary refinement. The way in which 
this can be done is described in RoUin (2005). There, and in this paper too, we use 
translations of Poisson distributions as approximations, instead of discretized normal dis- 
tributions, though, to the accuracies being considered, they are equivalent: the translated 
Poisson distribution TP (/x, cr^) is defined to be that of the sum of an integer a and a 
Poisson Po (A)-distributed random variable, with A and a so chosen that a + X — n and 

Using this approach, we are able to prove the following two theorems. We use d^y to 
denote the total variation distance between distributions: 

dTY{P,Q) := sup\P{A)-Q{A)\, 

A 

and d\oc to denote the local distance (point metric) between distributions on the integers: 

dUP,Q) := sup|P{j}-g{j}|. 

We define jo so that 

J] Pj > 1/2 > J2p^=:Po, 
j>jo-l j>jo 

and let no > 3 be such that j„, defined in (1.1), satisfies jn > jo for all n > no, and also 
that no/log^no > 16/Po- 

Theorem 1.1 If := EXn anda^ := VarX„, then 

rfTv(/:(i^n),TP(/i„,o-2)) = O(cr-i); 

ciioc(/:(i^n),TP(/i„,a2)) = Oia-'), 

uniformly in n > Uq. 

Theorem 1.2 For r > 1, setting ^n,r '■— ^K^^j. and cr^ ,. := Var X^ ,., we have 

c?ioc(>C(i^n,r),TP(/i„,^,a^^J) = 0{a~^^), 
uniformly in n > max{no, e''^'^, 2r}. 

RoUin's theorem and our construction are set out in Section 2, together with the 
general scheme of the proofs. The details for the two theorems are then given in Sections 
3 and 4. Some useful technical results are collected in the appendix. 
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2 The basic method 



We begin with tiie following theorem from RoUin (2005). Let W be an integer valued 
random variable, with mean /i and variance a^, and let M be some random element. 
Define 

:= E{W\M); := Var(W^|M); := Var(/iM); 

p2 := E(a^); u"" := Var(a^); U := t-\iim - 1^)- (2.1) 

Of course, a"^ — + r^. 

Theorem 2.1 Suppose that, for some e > 0, 

\E{f{U)-Uf{U)}\ < e\\f\\ (2.2) 

for all bounded functions f with bounded second derivative. Then there exist universal 
constants Ri and R2 such that 

< E{dTy{C{W I M), TP (//M, (Jm))} + + - + 
dUJ^{W),TP{p,a^)) 

1 23 

< E{dU^W I M), TP {pM, al,))} + R2^U + ^ + '-^}- 

Values of the constants are given in RoUin (2005). Note that (2.2) is exactly what has to 
be established for the simplest smooth metric standard normal approximation to C{U), 
using Stein's method. For U a sum of independent random variables, e would typically 
be the Lyapounov ratio, and thus the quantity a~'^T^e would be bounded by an average 
of the ratios of third to second moments of the summands. 

The theorem is useful provided that >C(iy | M) is such that it is well approximated 
for each value of M by the translated Poisson distribution with its mean and variance 
as parameters. This is the case, for instance, for sums of independent Bernoulli random 
variables, as well as for many sums of independent integer valued random variables, as 
noted in RoUin (2005). Here is the result that we shall use in what follows. 

Theorem 2.2 Suppose that C{W \ M) is the distribution of a sum ^j>i IjiM) of inde- 
pendent Bernoulli random variables with probabilities pj{M) such that fiM '■= Sj>iPj(-^) < 
00 a.s.; write a\j := X]j>iPj(^)(l ~ Pj{^'^))j '■— ^^('^m) '^^^ •= Var(o'^^). Suppose 
that < Cp^ for some C < 00. Then there exists universal constants Ci and C2 such 
that 

E{dTv{C{W\M),TP{pM:al))} < ^ + ^; 

P P 
AC -\- '?C 

E{dMW\M),TP{pM,<jl,))} < ^^V^- 

P 
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Proof. Bounds of the form 



dTviCiW \M), TP {fiM,(rl)) < min{Ci(T^\ 1}; 
d,,,mW\M),TF{fXM,cjl)) < min{C2(T^', 1}, (2.3) 

are given in Barbour (2009; Theorems 6.2 and 6.3), with Ci = 4 and C2 = 280. The former 
follows as in Barbour & Cekanavicius (2002, Theorem 3.1), and similar techniques can 
be used to establish the latter; see also RoUin (2005). Then, by Chebyshev's inequality, 
P[(tIj < < ^C/ p^. The bounds follow by taking expectations in (2.3). □ 

We now need to find a suitable collection of conditionally independent Bernoulli ran- 
dom variables. To do so, we start by observing, as before, that it is enough to consider 
indices j > jn in the sums, so we need only consider the distribution of {Nj, j > jn). 
We reahze these random variables in two stages: first, we reahze M :— (Mj, j > Jq) by 
throwing n balls independently into the boxes with indices j > jo, with probability Pj/Pq 
for box j, and then 'thinning' them independently with retention probability Pq, so that, 
conditionally on M, the (A'^-, j > jo) are independent, with A'^- ~ Bi {Mj, Pq). With this 
construction, it remains to evaluate the quantities appearing in RoUin's theorem, and to 
check that we have the right result. More specifically, we need to check that, for some 
constants C, C", C", 

(i) < Cp^; (ii) p2 > CV^ and (iii) e < Cr'^a^, (2.4) 

uniformly in the stated ranges of n, for the random variables Wn '■= Ylj>j„ ^[^3 — 1] 
Wn,r '■= J2j>j„^['^j = r], r > 1. Theorems 1.1 and 1.2 will then follow directly from 
Theorems 2.1 and 2.2. 

The first two inequalities in (2.4) cause no great problems, since they involve only 
variance calculations, though care has to be taken with the correlations in Theorem 1.2, 
because the summands in 

- E(^Ow-i^o)^-^ 

are not monotone functions of the (negatively associated) Mj. The main effort is required 
in evaluating e for the third inequality. We now sketch the structure of this argument, 
leaving the details to the next two sections. 

Take z{l), Z > 0, to be either Bi (Z, Po){[l, 00)} or Bi{l, Po){r}, as appropriate, (zero 
if Z = 0). Then define the quantity U that we wish to address hy U :— X]j>j„ where 

Cj ■■= n^iM,)), yj{l) := z{l)-Q and Y, := r-'y,{M,). (2.5) 

Thus U is a. sum of mean zero, weakly dependent random variables. In order to ap- 
proach (2.2), we begin by writing 

j>jn j>jn l>0 
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where qj{l) := P[M,- = I] and 

U^-^ := r-iJ^y.lMjr)), (2.7) 



and where 



Mj"^) := (M,-„ s>i„,s^i) ~ MN(m; (p,/Po,-, s>jn,s^j)) (2-8) 



is distributed as m balls thrown independently into the boxes with indices (s > jn,s ^ j) 
with probabilities {ps/Poj, s > jn,s ^ j), with Pqj '■— Pq — Pj > 3Po/4. We need to show 
that the expression in (2.6) is close to E{/'(C/)}. 

As a first step, wc use Taylor development to discard all but the constant and linear 
terms in E{/(6'"-" + T~^yj (/))}, establishing that 



E E^^(0%(OW(^i"-'^ +r-^y,(/)) -E/(t/f-')) -r-^y,</)E/'(t/f 

< k,a'T-'\\f"\\. (2.9) 

The next step is to remove the /-dependence in the constant term, replacing ^/j""'-* by 
U^^\ To make the computations, we realize and on the same probability space 

by writing Mj"^-* = Afj""'^ + z'j\ where Afj""'^ and Z^-^ are independent, and distributed 
as mJ""^ in (2.8), with m = n — I and m — I, respectively; and then defining Uj^ 
and C/j"^ as before, using (2.7). Using this representation, we then show that 

(2) 1^"' E E ?.(0%-(0 W(C/f -^^) - E/(C/f )) - E[/'(C/f -'))(C/f - C/f ))]} 

j>jn l>0 

< k2a'T-^f\\. (2.10) 

Although this has introduced a further term E[/'(C/j"'~'^)(C/j"~'^ — C/j"^)] involving I, there 

is simplification because E/(C/j"^) is multiphed by X]z>o — — 0> hence 

drops out. 

We now simplify what is left by showing that 

(3) 1^"' E E^^-(0%-(0{E[/'(C/i"-'^)(C/f-'^ - C/j"))] -E[/'(C/f )]E(C/j"-') - C/f ))} 

j>jn l>0 

< A;3<7V-^||r||. (2.11) 

As a result of this, the quantity E/(?7-"~'^) in (1) has been replaced by a multiple of 
E/'([/f^), with errors of the desired order, which is a useful step in approaching the 
intended goal of E,f'{U). There is also the quantity E/'([/j"~''') appearing in (1), but this 
is easily reduced to one involving only E/'(?7j"''), too: 

(4) |^-'EE^^-(%|(0{IE/'(C/f-'VEmf )}| <ha'r-V\\. (2.12) 

j>jn l>0 
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At this point, we have thus estabhshed that 



Et//(t/)-r-2^«:,E/'(t/f))| < (fc, + ^2 + fcg + fc4)aV-i/"||, (2.13) 

j>jn 



with 



(2.14) 



l>0 



and, for example by taking f{x) — x, 



1 = EU^ = r-'^Yl 



K.; 



j>jn 



In parallel with the above reduction starting from (2.6), we now start with 

^f'iU) = r-'J2K,EnU) = r-'Y:^K,J2q,mf'iUt'^ +r-'yM, (2-15) 

j>jn j>jn l>0 

and make two rather simpler steps, first proving that 

(5) |r-^E'^^E^^-WW(t/f-'^+r-^%(0)-E/'(C/f-'^)}| < ha'r-Vl (2.16) 

j>jn l>0 



and then that 

(6) |^-'E'^^E^^(OW(^f-'VE/'(t/f^)}| < ha\-V\\. 

j>jn l>0 

Putting these two into (2.15), it follows that 

E/'(C/)-r-^ J]«:,E/'(C/f^)| < {h + h)a'r-'\\f"\l 

j>jn 

and combining this with (2.13) yields 

\wiu)-uf{u)}\ < s\\n 

with (7~^T^£ < Ylt=i bounded, as required. 



(2.17) 



(2.18) 



(2.19) 



3 The argument for 

We begin by noting, for future reference, that we have 

,-1 



msixpj < An logn < Po/4 < 1/8; 

j>jn 



np^ < 16n ^log^n < Pq, 



(3.1) 



whenever n > no, and that (3 := (1 — Po/2) > 3/4. We use c and c' to denote generic 
universal constants, not depending on n or the p/s. 



For Kn-, we have £(W„ | M) that of a sum of indicator random variables Ij{M), j > 
with probabihties 

{l-(l-Po)^^} =: .(M,); 
recall (2.5). Hence al[ = Ej>j„ - z{Mj)), and 

j>jn 

Applying Lemma 5.1 (iv) with x = y/l — Pq, and using the fact that np^ < Pq, now 
immediately gives the lower bound 

min{l,npj}, (3-2) 

j>jn 

where Cp — c{y/l — Po)e~'^^°, and c(-) is as in Lemma 5.1. On the other hand, because 
the Nj are negatively associated, 

a' < 5]Var/[iV, >l] = J]{1 - (1 -P.)" < e-^^ min{l, np,}. 

j>jn j>jn j>jn 

It thus follows that > Cpa^, establishing (2.4) (ii). 

For i/^ = Var cr^, we note that aj^ is the difference of the random variables Si{M) :— 
Ej>jS^-Po)^' and S2{M) Ej>j^^- Po?^' , so that < 2(Varsi(M)+Vars2(M)). 
Since (1 — Pq)' is decreasing in Z, we can use the negative association of the M/s to upper 
bound the variances: 

Varsi(M) < ^ Var {(1 - Pq)"^^}; Vars2(M) < ^ Var {(1 - Pq)'^^}. 

j>jn j>jn 

Now both of these quantities can be bounded by using Lemma 5.1 (iv): 

Var{(l-Po)^^} < e-2^"f^min{l,2/3npj}, 

and 

Var{(l-Po)2^^} < e-2^'"^'^min{l,2/3'npj}, 

with /3' := 4 - 6Po + 4Po^ - Pq^. Thus is uniformly bounded, estabhshing (2.4) (i). 

It thus remains to prove that e < Cr'^a'^ for some constant C", and we are finished. To 
do this, we successively verify the inequalities (1) - (6) of Section 2. 

To establish inequality (1), we note that its left hand side is bounded by 

(3-3) 

3>3n l>0 

Now \yj{l)\ < 1, and 

^g,(Oz/|(0 = E{(l-Po)^^0-{E(1-Po)^0', 

1>Q 
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with Mj ~ Bi {n,pj/Po). Prom Lemma 5.1 (iv) with x — 1 — Pq, it follows that 

^qj{l)y%l) < e-2''"^'^min{l,2/?npj}. (3.4) 

Hence, from Lemma 5.4 (i), 

l>0 j>jn 

By (3.3), this proves (1) with h = K^^^'^K 
For inequality (2), we have 

|E{/(C/f )) - /([/]"-')) - /'(C/f -'^)(C/j") - C/j"-'^)}| < llir ||E{(C/f ) - C/f -'))2}. (3.5) 

Now 

r^E{(t/,(") - ty,("-'))^} < E{(Ez«Po(l-Po)<"y}, 

and the collections of random variables {ZjJ , s > jn) and ((1 — Pq) , s > jn) are 
independent, and each is composed of negatively correlated elements. Hence 

< P'o{Y. ^Z?s - Po)<""'})V Pi E E{(4^)nE{(l - Po)^<"'}. 

Now routine calculation gives 

PoEZji^ < /PoPs/Po, < p2E{(zj;V} < 2/p,(l + 2/p,); 

e{(1-Po)<"'} < e-^"-')^'- e{(1-Po)2<"'} < e-2/3("-')f% 

and hence, with crude simplifications, 

T2E{(t/f)-t/f-'V} < 10/2e^'5"Ep^e-2''"''^ < d2e'^"n-V^ (3.6) 

this last using (3.2) and Lemma 5.4 (i), where 5„ := 2p„ and c = lQ{K{2j3 — l)/cp). Hence, 
putting (3.5) and (3.6) into (2), we obtain the bound 

j>jn l>0 

< c'r-Viril exp{5„(3 + w/^o)} E + ^P.)' 

j>jn 

by Lemma 5.1 (ii) and (iii), and this is uniformly of order T~^cr^||/"|| in the stated range 
of n, because 

EPj(l + ^Pj)e~"^^ < Pnil + e-^) and 5n + n5npn < 5Po/4. 

j>jn 
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This establishes inequahty (2). 

For inequahty (3), we begin by writing 



E{(t/f~'^-[/fO/'(f^i )} 

- E(t/f ) - [/f -'))E/'([/f (3.7) 

note that introducing /'(Et/j""^-*) changes nothing, since it is multiphed by a quantity 
with mean zero. The first term we bound by 

lir II -^Var [E(C/f - C/f ^ | m]""'^)] -^Var C/f"'). (3.8) 

Since 



rE(C/j"-^)-C/j")|Mj"-^)) = ^(l-Po)<-'{l-(l-p,Po/Po.)'}, (3.9) 



and since the (M-^ , s > Jn) are negatively associated, it follows that 



r2Var[E(C/f-')-C/f)|Mj"-'))] < 4f ^ p^g-2/3(n-0P. 



for a suitable c. In much the same way, and using Lemma 5.1 (iv), we have 



T^Varf/f-') < J]Var{(l-Po)'"-""} < 2-^ J] np,e-^^("-'>^ < ce''-<j\ 



Hence the first term in (3.7) is bounded by 

CT-'||/"|Ue'^"n-^/V, (3.10) 
for a suitable c. For the second, we replace E/'(t/j"~'^) by E/'(t/j'*^): 

|E(t/f ^ - ?7j""'^){E/'([/f "'^) - E/'(?7j"^)}| < ||/"||E{(t/f ^ - t/f "'^)'}, (3.11) 

which is at most cr~^||/"[|/^e''^"n~^(7^. Putting these bounds into (3.7), it follows that the 
left hand side in (3) is at most 

^^-VW E E ?.(OI%(0|e''"{/n-VV + I'n-'a'} 

3>3n l>0 

< c'r-=^||/"|||n-VV J2 np,e-"P^- + a^}, (3.12) 

j>jn 



10 



by using Lemma 5.1 (ii) and (iii), for suitable constants c and c'. But now 

j>jn 

by Lemma 5.4 (iv), and this, together with (3.12), shows that (3) is satisfied. 
For (4), we use the simple bound 

|E/'(C/f"'^) -E/'(C/f))| < ||/"||E|C/f) -C/f-')| < T-^Z||/"||. (3.13) 
This gives a bound for the left hand side of (4) of 

j>jn l>0 j>jn 

by Lemma 5.4 (i); and hence we have proved (2.13). 

For the remaining two inequalities, we observe that, from (2.14) and (3.4), 

:= max{Kj,0} < 2/5npje-^^"P^ (3.14) 

whereas, from (3.9), 

Kj = \mm{0, Kj}\ < J]] g^- (/)!%■(/) I ^2/p,e-("-'>» < cnp^e-^f^ J]] ^^6""^'% (3.15) 

l>0 S>jn S>jn 

from Lemma 5.1 (ii) and (iii). Hence, for inequality (5), we obtain the bound 

j>jn l>0 j>jn 

< cr-ir II J] np,e-^"^^- < hr-'aV\\, (3.16) 

j>jn 

by Lemma 5.4 (i), for a suitable ^5. For inequality (6), we start from the bound 

j>jn l>0 

j>jn l>0 s>3n j>jn S>jn 

again from (3.9) and Lemma 5.1 (ii), and substituting from (3.14) and (3.15) for |«;j| gives 
at most 

cr-irilE(^^^i)'{^"^""'"''+^""''(E^^^^""'0'} - (3-17) 

j>jn S>in 

by Lemma 5.4 (i) and (iv). Since (3.16) and (3.17) together establish (2.18), we have 
completed the proof of (2.19), and hence of (2.4) (iii), thus proving Theorem 1.1. 
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4 The argument for K^^r 

Fix r > 1. We now require n to satisfy 41ogn > r — 1 and n > 2r. Then, with 
p :— Pj„-i > 4n~^ logn, we have 

j<jn 

< n~^{4:logny~^e''/r\ < c{logny~^n~^, 



since x^e ^ is decreasing in x > s and 41ogn > r — 1. Hence X]j<j„ -^[-^j = J^] = 
except on a set of probabihty of order 0(n~^(logn)^~^), and we can restrict attention to 
Wn,r Ej>j„ = We recall that /?:=(!- Po/2) > 3/4, and that 

Pn < n/4 < 1/8 and npl < Pq, 

whenever n > Uq. The generic constants c and c' are now allowed to depend on r. 

For Kn^r, the distribution C{Wn^r I is that of a sum of indicator random variables 
Ij{M), j > jn, with probabilities 

(^^)Po^(l-Po)^^ z{M,); 

recall (2.5). The argument now runs much as before, but is complicated by the fact 
that z{-) is not monotonic in I. First, we have = Ylj>j„^^i-^j) — Ylj>j„Cjj with 
Q := Bi {n,pj){r}, whence, defining 

j>jn 

it easily follows that 

exp{-np^^ - n"V^} < /^/fir < e""^", (4.1) 

for n > 2r, with both lower and upper estimates uniformly bounded away from zero and 
infinity in the chosen range of n: hence and fir are uniformly of the same order. 
Now 

= J]z(M,)(l-^(M,-)) > J2z{M,){l-Zr), (4.2) 

j>jn j>jn 

where Zr :— max;>r (^)Po(l — Po)'"'' < 1, and hence 

= E(7^ > fi{l-zr). (4.3) 

For 

= Ye^rWn = ^ 5^{P[iV, = iV, = r] - P[iV, = r]P[iV, = r]}, 

j>jn S>jn 

we use Lemma 5.3 to give 

P[Nj = Ns = r]- P[Nj = r]P[iV, = r] < 2er{pj + Ps)e*'P"'P[Nj = r]P[iV, = r], j s, 
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and adding over j and s gives an upper bound of at most 

j>jn S>jn 

For j = s, the total contribution to the variance is at most Ylj>j ^i^j = = /J,. Hence, 
and from (4.3), we have 

(T^ X p2 X // X fir, (4.4) 

where the imphed constants are universal for each r. This shows also that (2.4) (ii) holds. 
For (2.4) (i), we take 

:= Var(<7^) = Var ^(M,)(l - z(M,))) , 

j>3n 

to which we can apply Lemma 5.3, noting that < z(l)(l — z(l)) < (')Po(l ~ -fo)'"'^- For 
j ^ s, this gives 

Cov{;^(M,)(l-^(M,)),^(M,)(l-^(M,))} < c(p,+p,)(n(p,+p,)+2r)(np,)'-(np,)V-"(^^+^'^), 
by Lemma 5.2. Adding over j and s, this gives at most 

c'l J]pj(npj- + 2r)(npj)"e-"^'^ ^(np^V^^^ + PjimY'^"''^' J]^^^^)"^^^'"^'}' 

j>jn S>jn j>jn S>jn 

(4.5) 

and this is at most cPnp>r + KuPnfir, by Lemma 5.4 (iii) and (iv). The terms with j — s 
give at most 

Y^^{z\M,)} < ^E{[(M,)(2.) + (2r)w(M,)w](l-Po)^(^^-'-)} 

< c{(np,)''^ + (np,)'^}e-2^("-'-)f^ (4.6) 

by Lemma 5.1, and because l'^^-^ < (^j)/(2r) + (2r)(r)/(r)- Adding over j, this gives at most 
a contribution of cfir, by Lemma 5.4. Thus we have shown that < ca"^, and (2.4) (i) is 
satisfied. It thus remains to show that e < CT~^a'^, and the proof is accomplished. 

To establish inequality (1), we once again observe that \yj{l)\ ■— \z{l) — 'Ez{Mj)\ < 1, 
and hence, recaUing (3.3), that 

Ir-'WfW Y.^\yAMj)\' < r-'WfW Y.^z\M,) < CT-'WfWfir, 

j>jn j>jn 

as for (4.6); so (1) holds, as required. 

For (2), we recall (3.5). We then note that, for u > r, 

\,^U + t)-z{u)\ = pr\(^yi-P,)-r-^ + ^yi-P,)-+*-r\^ < c(^^^(l-Po)^ (4-7) 
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for c a universal constant. Prom this, it follows that 

Since (xi H — ■ + x^)^ < r{xl + • • • + xf), we can bound T^E(t/j"'' — by considering 

the r different sums separately. 
First, for 

s>jn ^ ' 

using the independence of Zj^^ and Mj^~'''' and Lemma 5.2, and with 5„ = 2p„ as before, 
the off-diagonal terms give at most 

S>jn t>jn 

the last line using Lemma 5.4 (v). The terms with j = s then contribute at most 

S>jn 

using Lemma 5.4 (ii). The contribution to T^E(C/j"^ — t/j""''')^ from this first sum is thus 
no more than cPe^^^'^rT^jlr 

For 0<w<r — 1, we need to find similar bounds for 

Here, the off-diagonal terms contribute at most 

by Lemma 5.4 (v), and the diagonal terms give at most 

S>jn 

Since, in the above, u <r — 1 and Z < n, it follows that 

r^^{uf - C/f < d^e^'^-n-^,. (4.9) 
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Returning to (2), and once again recalling (3.5), we thus have a bound of 



j>jn l>0 j>jn 



j>jn 



from Lemma 5.4 (iii), and this completes the proof of (2). 

For inequality (3), recalling (3.7) and (3.8), we first need to bound the variance 
Var {E(t/f ) - t/f I Mj"-'))}. Now 

rE([/f)-t/f-'^|Mj"'')) = ^E(z(Mj,"))-z(Mjr'^)|Mj."-')) =: ^ <7,(Mjr'^, 



(0 „„j 



where, from (4.7) and the independence of Zj. and 



\9s{t)\ < f^^[,)(^-PoyPS: (4.10) 

but Qs is not non-negative. From Lemmas 5.3 and 5.2, the off-diagonal terms in the 
variance Var{^^>^.^ s^jd^i^j^ '"*)} contribute at most 

cPe^'^" J] J]p,p,(np,)'-(npO''{(p.+Pt)(l+^P.+^Pt)+^~'(l+m)(l+m)+Wt}e-"^^+^*\ 

S>jn t>jn 

and, using Lemma 5.4, this can be bounded by cPe^^^'^rr'^Pnfir- The diagonal terms in 
turn yield at most 

^ Var^,(Mj"-')) < c/^e^"" J]p2^np,)''(l + (np,)0e"'^"^^ < c'/^e^'^-n'^P^, 

a>3n S>jn 

by Lemma 5.4 (iii). Since also /t^ < cn, it follows that 

Var{E(t/f)-C/f-')|Mj"-'))} < cT-H^e^''"n-'Pn. 
For Var C/j""'^ , the considerations are similar but easier, since we now have 

< z{t) < Q (1 - PofP^ 

in place of (4.10), and the contributions from both diagonal and off-diagonal terms are 
bounded by e^^^^pLr- Hence, and recaUing (3.7) and (3.8), we have arrived at a bound 

|E{[E(?7f - f/f ^ I Afj""')) - E(t/j"-') - t/f ^)](/'(t/j""'^) - /'(E[/j"-')))}| 

< CT-^f"\\le'''-./^;KF; (4.11) 
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the analogue of (3.11), 

|E([/f ) - [/f -'^){E/'([/f - E/'([/f^)}| < CT-^\\f"\\ l^e^'^"n-'fir, (4.12) 
follows directly from (4.9). Hence, for (3), we have 

^~'EE^^W%(0{E[/'(^f-'^)(t/f-'^ - t/f )] -E[/'([/f )]E 

j>jn 1>Q 

3>jn 

< c'r-V\\ { J](np,)'-+^(l + np,)e-"^'^}(V;a;::^ + n-^A^^ 
and since 



3>jn 



j>jn 

by Lemma 5.4 (v), we conclude that inequality (3) is indeed satisfied. 
For inequality (4), we use the simple bound in (3.13), obtaining 

^"'EE^^(o^l(ow(^f-'^)-E/'([/f))}| < r-v\\ j2nM,ym)} 

j>jn l>0 j>jn 

< cr-V\\Y^{npjni + {npjY^')e-'^-^^ < c'jlrr-Vl 

j>jn 

from Lemma 5.1 (iii), in much the same way as for (4.6). Hence we have now estab- 
lished (2.13). 

For (5) and (6), we need the constants Kj, for which we now have the bounds 
from (4.6), and 

Kj < cE{M,|y,-(M,-)|e^^^-^"}v^ 
< c'{npjY{l + npj)e'"^'Pi 

from (4.9). For inequality (5), this immediately gives a bound of 

CT-''\\r'\\Y^\K,\{np,Ye--^^ < c'ilrr-V\\, 

j>jn 

using Lemma 5.4 (ii); for (6), we obtain the bound 

j>jn 

where, for the contribution from nj , we again use Lemma 5.4 (v), much as for (4.13). 
This completes the proof of (2.18), and thus of Theorem 1.2. 
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5 Appendix 

We collect several useful calculations, the first two of which need little proof. We write 
777.(5) '■— Tn{m — 1) ... (777 — s + 1). 

Lemma 5.1 //M ~ Bi {m,p), then for any x > and < s <m, 

In particular, if x = , where Q < 5 < 5q <1, and if (1 — P)e^° < 1, then 
(a) E{M(s)X^} < (mpY ex.p{5o(s + mpe)}; 

(Hi) E{M(,)[(1-P)e^]^} < (mp(l-P))"e-("*-")f^exp{5o[s + mpe(l-P)]}. 
Furthermore, for < x < 1 and p < 1/2, we have 

(iv) c{x)e-^"'P' mm{l,mp} < e^'P^^-'^'^Ex^^ - {Ex^f} < mm{l,mp{l - x^)}, 
where c{x) := min{(l — e"^^""^^^), (1 — xj^e"^^""^^^)}. 
Proof. We prove only (iv). Prom (i), we have 

E.--(E.-r - {1 -x-)r{i - (1 - '^'.i^l^sj^ yy 

The upper bound follows immediately, using the fact that 1 — p < 1 — p(l — x^). The 
lower bound 

also uses the fact that p < 1/2, and the argument is completed in standard fashion. □ 

Lemma 5.2 Let {L,M,m-L-M) ~ MN (777; p, q, 1-p-q) be trinomially distributed. 
Then 

E{L(„)M(,)«;^x^} = 777(„+,)(«;p)"(xg)^(l+p(«;-l) + g(x-l))"^-"-^ 

/r7 particular, ifQ<w,x < where 0<6 <6q<1, and if (1 - P)e^'' < 1, then 

E{L(„)M(^)'u;^x^} < {mpYirnqf exp{5o[(^i + + 777(p + g)e]}; 
E{L(„)M(,)[(l-P)eY+''} 

< (777p(l - P))"(777g(l - p))^e-("^-"-^)(P+^)-^ exp{5o[(7i + v) + m{p + q)e{l - P)]}. 
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Lemma 5.3 Let {L,M,m - L - M) ~ MN (m; p,q,l — p — q) be trinomial, where 
p + q < S < 1/4, and let the functions f, g, h, k satisfy < f{l) < h{l) and < g{l) < k{l) 
for / e Z+ . Then 

Cov(/(L),^(M)) < C, 

:= e(p + g){E(L/i(L)e2^^)E(A;(M)e'^'5) +E(/i(L)e'^^)E(MA;(M)e2^'5)}. 

/// and g are not nonnegative, hut \ f\ and \g\ are hounded as ahove, then 

A/fYY 

Gov {f{L),g{M)) < Ci + 2m-^E{Lh{L))E{Mk{M)) + —pqEh{L)Ek{M) . 
Proof. From the multinomial formulae, we have 



f{u)g{v){P[L = u,M = v]-P[L = u]P[M = v]} 

— — —^^^ — P Q {m{u+v)[i- - P - Q) - "^(„)m(^)(i -pj [i--Q) I 

< f{u)g{v)P[L = u]P[M = v]{{l-p-q)-^''+''^ -1} (5.1) 

< h{u)k{v)P[L = u]P[M = v]{p + q){u + v) exp{2(p + g)('u + v + 1)}, 

where the last inequality uses p + q < 1/4. The first part of the lemma now follows. 
For the second part, (5.1) should be replaced by 



\f{u)g{v)\P[L = u]P[M^v] 
||(l-p-g)-("+^)-l| + 



(m - m)(^) 



- 1 



m 



after which we use the bounds 



(m - u){y) 



- 1 



{v) 



< 



2uv 



m 



(v) 



1 - 



+ 



1 - 



pq 



;i-p)(i-g) 



pq 



;i-p)(l-g) 



- 1 



< Ampq/S. 



□ 



Lemma 5.4 Letps, s > j, he nonnegative numhers summing to P <1, and define 
<^l{r) := J](np,)'-e-"^'% r > 1; <7^(0) := Y.'^iTi{npsA)e-^^^ . 

s>j s>j 

Then there exist universal constants Kr'^\ K^, K^v and K' such that, for any integers 
u > V > and for any a > 0, 

(z) ^(np,)"+^e-(i+")"f» < ii'i"V2(0); (zz) ^(np,)'^+V-(^+")'^^'^ < ii'i"V^(r); 

s>j s>j 

{Hi) ^(np,)"+^e-"P^ < KuuP; (iv) (^J] np^e'^^^)' < i^W^(O); 

s>j s>j 
s>j t>j 
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Proof. The first inequality reflects the fact that 

^u+l^-{l+a)x < ^^-x for < X < 1, 

whereas x^'^^e^^-^'^"^''^ < e^^ sup^>^{2;e^"^}: thus we can take K^"'^ = 1/ea. The second is 
similar in vein, but easier. The third inequality, and case u = v = in the fifth, follow 
from 

J]](np,)"+ie-"P^ = nJ2Ps{nPsTe-''P^ < nP{u/eY. 

s>j s>j 

For the fifth with w > 1, we write the sum as 

^ '—^ L r + — 1 J 

s>3 t>j 

and use Cauchy-Schwarz to yield the upper bound 

s>j 

< nP y (np,)^-"^^ max{x''+"+''-^ exp{-x(r + v-l)/{r + u- 1)}}, 

s>j 

noting that r + u — 1 > 1. For the fourth part, Cauchy-Schwarz gives 

(^npsC'P"^ < n^npse"^"^" < ^ minjnps, e"^}e""^^ 

s>j s>j s>j 
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